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SUMMARY  OF  RESULTS 


1.  For  the  thick  multilayer  plastic  F-l  1 1  windscreens  used  in  the  present  study  there  was  an  appreciable 
positive  correlation  between  lens  factor  (LF)  and  displacement  grade  (DG).  When,  for  a  windscreen, 
either  one  is  large,  the  other  also  tends  to  be  large,  and  similarly  for  intermediate  and  low  values. 

2.  There  was  no  significant  amount  of  relationship  between  either  LF  alone  or  DG  alone  and  either  the 
percentage  of  observes  rating  the  windscreens  as  acceptable  (%  A)  or  the  percentage  who  said  that 
enough  distortion  was  present  to  influence  flying  performance  (%  I). 

3.  Neither  K.  E.  Thompson’s  windscreen,  acceptability  graph  based  on  a  combination  of  LF  and  DG  nor 
the  present  author’s  “R”  criterion  ofV(LF)2  +  (DG/1,000  +  l)2  was  effective  in  sorting  windscreens 
according  to  either  Acceptability  (%  A)  or  Estimated  Influence  (%  I). 

4.  As  expected,  the  correlation  between  %  I  and  %  A  was  high,  negative  and  statistically  significant.  When 
either  is  high  for  a  windscreen,  the  other  tends  to  be  low. 

5.  There  were  high  positive  statistically  significant  correlations  between  estimated  overall  distortion  and 
subjective  estimates  of  shimmer,  grid  line  banding,  grid  line  blur  and  grid  line  bending,  but  not  with  grid 
line  splitting.  Splitting  did  not  correlate  significantly  with  any  other  optical  distortion  measure. 

6.  The  types  of  optical  distortion  that  have  a  large  affect  upon  acceptability  (%  A),  estimated  influence  (%  I) 
and  estimated  overall  optical  distortion  were  shimmer,  banding,  blurring  and  bending. 

7.  There  were  high  (r  =  .61-.93)  positive  statistically  significant  correlations  between  shimmer,  banding 
and  blur  estimates. 

8.  Grid  line  banding  ratings  correlate  +.961  and  grid  line  bending  ratings,  +.932  with  estimated  overall 
optical  distortion.  Thus,  ratings  on  either  banding  or  bending  could  be  used  to  predict  overall  optical 
distortion  ratings. 

9.  The  multiple  correlation  coefficient,  R,  between  overall  optical  distortion  ratings  and  predicted  overall 
optical  distortion  ratings  based  on  the  estimates  of  shimmer,  banding,  blurring  and  bending  is  +.9763. 

10.  An  equation  for  predicting  estimated  overall  distortion  from  estimated  shimmer,  banding,  blurring  and 
bending  did  an  excellent  job  of  prediction,  as  such  a  large  R  would  indicate.  The  percentage  error  in 
predicting  was  less  than  10%  for  9  out  of  11  windscreens:  one  windscreen  prediction  was  in  error  by  10.6% 
and  another  erred  by  17.4%. 

11.  Out  of  the  14  subjects  (or  observers)  in  the  present  study,  6  were  Pittsburgh  Plate  Glass  Co.  visual  quality 
inspectors.  When  distortion  ratings  of  the  two  groups  were  compared  it  was  found  that:  On  six  of  the 
seven  types  of  optical  distortion,  agreement  between  pilots  and  inspectors  were  quite  close.  Even  on  the 
one  exception  (grid  line  splitting),  both  groups  rated  closer  to  “Zero”  distortion,  the  lowest  category,  than 
to  “Little”  distortion.  For  all  practiced  purposes,  agreement  was  close  on  all  types  of  optical  distortion. 
The  r  between  the  two  groups  on  estimates  of  all  seven  distortions  was  +.9557. 

12.  Although  differences  between  pilots  and  inspectors  are  negligible,  the  inspectors  tended  to  rate  optical 
distortion  slightly  lower  than  did  pilots  on  all  types  of  distortion. 

13.  In  terms  of  use  of  rating  categories,  the  inspectors  tended  to  use  the  “No  Distortion”  category  signifi¬ 
cantly  more  often  than  the  pilots  did,  with  a  consequent  tendency  to  use  the  “Little  Distortion”  category 
less  often.  However,  agreement  between  pilots  and  inspectors  was  close  on  the  “Moderate”  and  “Large” 
distortion  categories,  and  the  magnitude  of  the  difference  in  the  “Extreme”  distortion  category  was 
small. 
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14.  In  terms  of  particular  windscreens,  the  average  ratings  for  the  two  groups  on  the  two  windscreens  for 
which  pilot  and  inspector  ratings  differed  the  most  were  actually  only  a  fraction  of  an  optical  distortion 
category  apart.  The  inspectors  were  not  more  lenient  (less  critical)  in  their  ratings  than  pilots  were  for 
the  higher  distortion  windscreens  for  which  rating  differences  could  have  a  practical  significance. 

15.  Pilots  were  more  critical  than  inspectors  in  judging  the  influence  on  flying  (%  I)  of  optical  distortion  and 
in  rating  windscreens  on  acceptability  (%  A).  The  differences  were  statistically  significant.  Pilots  and 
visual  quality  inspectors,  while  agreeing  fairly  closely  on  amounts  of  several  types  of  distortion  and  on 
the  overall  optical  distortion  of  specific  windscreens,  were  not  very  close  in  judging  (or  rating)  acceptabil¬ 
ity  (%  A)  and  influence  upon  flying  of  distortions  (%  I). 
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PREFACE 


This  study  was  part  of  a  joint  effort  by  the  Crew  Station  Integration  and  Technology  Development  Branches 
of  the  Human  Engineering  Division,  Air  Force  Aerospace  Medical  Research  Laboratory  (AFAMRL)  at 
Wright-Patterson  Air  Force  Base,  Ohio.  It  was  done  under  Project  7184,  Man-Machine  Integration  Technol¬ 
ogy,  Task  18,  Visual  Effects  of  Windscreens  on  Pilot  Performance,  and  Work  Unit  03,  Visual  Perception 
Through  Windscreens.  The  work  was  undertaken  in  support  of  the  Improved  Windscreen  Protection 
Development  Program,  Projects  2202  and  1926  of  the  Air  Force  Wright  Aeronautical  Laboratories,  Flight 
Dynamics  Laboratory  (AFWAL/FD)  Wright-Patterson  Air  Force  Base,  Ohio. 

The  report  analyzes  data  collected  at  the  Pittsburgh  Plate  Glass  Company  (PPG),  Creighton,  Pennsylvania. 
The  courtesy,  cooperation  and  support  rendered  by  the  management  and  personnel  of  the  company  are 
acknowledged.  The  author  also  is  thankful  for  the  services  of  the  USAF  pilots  and  the  PPG  visual  quality 
inspectors  who  made  the  study  possible.  The  assistance  of  Lt  Col  Joseph  A.  Birt  in  arranging  for  the  test 
subjects  and  securing  the  cooperation  of  the  manufacturer  is  appreciated. 

This  report  covers  one  of  the  two  studies  conducted  at  the  PPG  company  facility  using  the  same  pilots  and 
inspectors  as  subjects. 
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SECTION  I 

INTRODUCTION 


PROBLEM  AND  OBJECTIVE 

It  is  difficult  to  fabricate  aircraft  windscreens  that  do  not  contain  some  optical  distortion.  This  is  particu¬ 
larly  true  when  windscreens  are  thick,  of  multilayer  construction,  highly  curved,  and  installed  so  that  the 
user  looks  through  them  at  an  angle  departing  by  over  30°  from  normal  incidence.  All  of  these  favorable-to- 
distortion  conditions  were  present  in  some  of  the  early  F-lll  aircraft  windscreens.  A  compromise  must  be 
made  between  optical  perfection,  which  is  materially  and  economically  unfeasible,  and  severe  optical 
distortion  that  is  intolerable  to  pilots.  Some  distortion  is  acceptable.  The  Air  Force  needs  data  to  establish 
visual  distortion  tolerances  for  use  in  purchasing  windscreens.  What  amount  of  distortion  will  be  visually 
acceptable  and  what  amount  can  be  present  without  causing  more  than  a  negligible  effect  on  piloting  an 
aircraft? 


The  present  study  was  conducted  to  obtain  some  information  about  human  judgments  of  optical  distortion 
in  windscreens.  The  data  were  collected  at  a  fabrication  plant  owned  by  the  Pittsburgh  Plate  Glass  Co.  using 
11  F-lll  windscreens.  The  current  (1981)  F-lll  windscreens  are  not  made  like  the  oneB  in  this  study.  Part  of 
them  were  right-side  windscreen  panels  and  part  were  left-side  panels.  The  two  are  identicial  except  that  one 
is  “right-handed”  and  the  other  “left-handed.” 

Since  only  11  windscreens  were  available  for  distortion  testing,  and  the  testing  used  stationary  gridboards 
rather  than  moving  or  dynamic  scenes,  the  judgments  of  the  observers  will  depart  somewhat  from  ratings 
made  by  observers  of  a  real  moving  scene.  Possibly  more  of  the  windscreens  would  not  be  acceptable  in 
flight  tests,  especially  if  observation  times  were  extended,  than  would  be  unacceptable  in  static  tests.  Due  to 
the  small  sample  size  and  the  static  testing,  the  present  study  is  concerned  largely  with  the  interrelation¬ 
ships  of  distortion  measures  and  comparisons  of  the  distortion  ratings  of  pilots  and  visual  quality 
inspectors. 

WINDSCREEN  DISTORTION 

Optical  distortion  in  aircraft  windscreens  can,  from  the  viewpoint  of  an  observer,  take  many  different  forms 
or  modes  of  appearance.  For  example,  the  lines  of  a  grid  board,  when  observed  through  an  aircraft 
windscreen,  may  appear  to  split,  to  bend,  to  blur,  to  shimmer  when  the  observer's  head  is  moved,  to  show 
banding  effects,  to  exhibit  “bulls  eyes”  due  to  localized  magnification,  etc.  These  effects  are  due  to  variations 
in  the  thickness  of  materials  and  in  the  surface  configurations.  The  windscreen  acts  as  though  it  contains 
surface  “waves”,  spherical,  cylindrical  and  aspheric  lenses  of  low  refractive  power,  prisms  of  low  power,  etc. 
The  distortions  are  refractive  or  light  bending  phenomena. 

Clearly,  optical  distortion,  as  found  in  aircraft  windscreens,  is  a  complex  of  variables  rather  than  a  single 
variable.  Unfortunately,  the  technology  of  windscreen  optical  distortion  measurement  is  not  well  developed. 
Much  work  is  being  done  in  various  laboratories,  using  a  wide  variety  of  approaches,  to  advance  the  state  of 
the  art  At  the  present  time  there  are  several  objective  measures  of  optical  distortion  in  aircraft  windscreens. 
Two  that  were  used  on  F-lll  windscreens  are  lens  factor  (LF)  and  displacement  grade  (DG).  LF  is  a  measure 
of  lensing  or  magnification.  It  is  the  cube  of  the  size  increase  in  grid  squares  per  inch  on  a  photograph  of  a 
grid  pattern  (“grid  board”)  taken  through  the  windscreen  with  the  camera  located  at  the  pilot’s  design  eye 
position.  One  thousand  times  the  sum  of  the  greatest  horizontal  and  vertical  displacements  of  grid  lines 
from  the  same  photograph  in  the  windscreen  area  with  the  largest  lens  factor  is  termed  displacement  grade. 

We  gave  Mr.  K.  E.  Thompson  of  General  Dynamics  the  problem  of  making  optical  measurements  of 
windscreens  that  would  allow  his  company  to  detect  windscreens  having  optical  distortions  that  would 


•  Thompson  K.  E.,  Optical  Evaluation  of  the  F-lll  Windscreen,  FZM-12-10962,  Feb.  12, 1970.  This  is  an 
internal  publication  of  the  Fort  Worth  division  of  G  leral  Dynamics  Corporation. 


result  in  rejection  of  the  windscreens  by  aircraft  pilots.  In  1969*  he  showed  that  LF  and  DG,  in  combination, 
could  be  used  to  divide  windscreens  into  three  groups:  acceptable  to  pilots,  questionable,  and  not  acceptable 
to  pilots.  V  ery  few,  only  eight,  of  the  65  F-  111  windscreens  that  he  used  fell  into  the  questionable  or  gray  area 
of  his  graph  of  LF  versus  DG.  The  questionable  ones  required  visual  examination  to  determine  acceptability 
to  aircraft  pilots.  Thompson’s  thin  glass  F-lll  windscreens  differed  both  mechanically  and  optically  from 
the  11  thick  multilayer  plastic  ones  of  the  present  study.  Measurements  on  Thompson’s  acceptability  graph 
yielded  LF  and  DG  values  which  are  listed  in  table  1.  The  table  also  gives  the  calculated  values  of  R,  an 
LF-DG  composite  derived  by  the  author  of  the  present  report  to  do  the  same  task  of  acceptability-to-pilots 
discrimination  as  Thompson’s  acceptability  graph.  Numerically  R^LF)2  +  (DG/1,000  +  l)2.  Note  from  the 
table  that  the  values  of  R,  listed  in  order  of  increasing  numerical  values  may  be  divided  into  three  areas. 
They  are  separated  in  the  table.  They  are:  an  acceptable  (+)  range,  R  =  0-1.682,  where  all  windscreens  are 
acceptable;  a  questionable  or  gray  area,  R  =  1.685-1.725,  where  some  are  acceptable  and  some  are  not 
acceptable  (— );  and  an  unacceptable  area  or  region,  R  >  1. 726,  where  all  windscreens  are  unacceptable.  Eight 
of  Thompson’s  65  windscreens  are  in  the  gray  area  in  the  table.  One  of  the  tasks  of  the  present  report  is  to 
examine  the  suitability  of  R  for  use  with  the  11  F-lll  windscreens  examined  at  the  Creighton  PPG  plant 
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TABLE  1 

LENS  FACTOR,  DISPLACEMENT  GRADE,  THEIR  COMPOSITE, 
AND  THE  ACCEPTABILITY  TO  PILOTS  OF  65  F-lll  WINDSCREENS 
(K.  E.  THOMPSON  DATA*) 


NO 

LF 

DG 

R 

nr 

n 

LF 

DG 

R 

B 

£3 

LF 

DG 

R 

A 

1 

1.041 

71.2 

1.493 

+ 

24 

1.197 

90.0 

1.619 

+ 

43 

1.335 

128.2 

1.748 

- 

2 

1.047 

80.0 

1.504 

+ 

23 

1.150 

150.5 

1.627 

+ 

46 

1.329 

140.8 

1.751 

- 

3 

1.047 

90.5 

1.512 

+ 

25 

1.197 

110.0 

1.632 

+ 

47 

1.331 

150.5 

1.759 

- 

4 

1.048 

99.5 

1.519 

+ 

26 

1.208 

110.0 

1.641 

+ 

48 

1.329 

160.0 

1.764 

- 

6 

1.038 

109.8 

1.520 

+ 

27 

1.200 

120.5 

1.642 

+ 

49 

1.331 

180.0 

1.779 

- 

7 

1.047 

109.8 

1.526 

+ 

28 

1.209 

120.0 

1.648 

+ 

50 

1.341 

190.0 

1.793 

- 

5 

1.058 

99.5 

1.527 

+ 

29 

1.208 

129.5 

1.654 

+ 

51 

1.347 

192.5 

1.799 

- 

8 

1.050 

120.0 

1.535 

+ 

30 

1.208 

140.0 

1.661 

4* 

52 

1.400 

140.0 

1.805 

- 

9 

1.056 

118.0 

1.538 

+ 

31 

1.208 

170.5 

1.682 

+ 

53 

1.400 

152.9 

1.818 

- 

10 

1.053 

124.2 

1.540 

+ 

33 

1.269 

110.8 

1.685 

- 

58 

1.329 

251.5 

1.826 

- 

13 

1.100 

90.8 

1.549 

+ 

34 

1.277 

120.0 

1.699 

+ 

54 

1.408 

170.8 

1.831 

- 

11 

1.053 

140.0 

1.552 

+ 

35 

1.269 

130.0 

1.699 

+ 

55 

1.400 

190.0 

1.837 

- 

14 

1.093 

106.0 

1.555 

+ 

32 

1.197 

211.0 

1.703 

- 

57 

1.397 

210.8 

1.848 

- 

12 

1.058 

142.0 

1.557 

+ 

36 

1.259 

149.5 

1.705 

+ 

59 

1.472 

150.0 

1.868 

- 

16 

1.100 

109.5 

1.562 

+ 

37 

1.269 

149.8 

1.712 

- 

60 

1.471 

170.0 

1.880 

- 

15 

1.103 

108.0 

1.563 

+ 

38 

1.269 

160.0 

1.719 

- 

56 

1.460 

206.8 

1.894 

- 

17 

1.100 

120.0 

1.570 

+ 

39 

1.277 

160.0 

1.725 

+ 

62 

1.500 

180.5 

1.909 

- 

18 

1.150 

100.0 

1.591 

+ 

40 

1.330 

100.0 

1.726 

- 

61 

1.480 

211.2 

1.912 

- 

19 

1.150 

110.8 

1.599 

+ 

41 

1.321 

120.0 

1.732 

- 

63 

1.550 

221.5 

1.973 

- 

20 

1.150 

120.0 

1.605 

+ 

44 

1.324 

132.5 

1.742 

- 

64 

1.620 

294.8 

2.074 

- 

21 

1.153 

120.0 

1.607 

+ 

42 

1.338 

117.8 

1.743 

- 

65 

1.819 

232.2 

2.197 

22 

1.150 

130.0 

1.612 

+ 

45 

1.329 

132.2 

1.746 

- 

*Taken  from  measurements  made  on  Thompson's  graph. 

Key: 

NO  =  Windscreen  Number  on  Graph;  LF  =  Lens  Factor: 

DC-  =  Displacement  Grade;  A  =  Acceptability  to  Filots;  j  *"  ■  ■  — ■  —  -  -  -r 

R  *  LF  and  DG  Composite  *  Vector  Length  or  Distance  =  ^LF)^  +  (DG/1,000  +  1) 
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SECTION  II 

OBSERVERS,  INSTRUCTIONS  AND  TESTING 

PARTICIPATING  OBSERVERS 

All  of  the  people  who  served  as  observers  and  raters  in  this  study  had  already  spent  several  hours  in  making 
judgments  about  the  various  defects  present  in  the  11  windscreens  that  were  to  be  examined  and  rated.  They 
had  all  been  instructed  not  to  discuss  their  ratings  or  observations  with  other  observers.  Six  of  the  observers 
(or  subjects)  had  had  considerable  experience  with  judging  the  visual  quality  of  aircraft  windscreens.  This 
group  will  be  referred  to  as  visual  quality  inspectors  (VQI).  It  included  five  Pittsburgh  Plate  Glass  visual 
quality  inspectors  and  one  Air  Force  Major  with  several  years  of  experience  in  the  laboratory  and  in  the  field 
with  windscreen  inspection.  Six  of  the  observers  were  Air  Force  pilots  all  of  whom  had  several  years  of 
flying  experience.  The  two  remaining  observers  were  familiar  with  aircraft  windscreen  problems,  but  were 
neither  pilots  nor  visual  quality  inspectors. 

INSTRUCTIONS  TO  OBSERVERS 

Observers  were  told  that  they  would  be  asked  to  make  judgments  about  11  F-lll  windscreen  panels,  some  left 
or  pilot’s  panels,  and  some  right  or  copilot’s  panels.  They  were  to  stand  on  a  bench  and  look  through  each 
panel  at  a  well-illuminated  grid  board  pattern.  The  grid  board  consisted  of  tightly  stretched  white  strings 
just  in  front  of  a  black  panel,  giving  white  lines  on  a  black  background.  The  string  formed  1-inch  squares. 
Observers  were  permitted  to  move  their  heads  in  any  way  that  they  desired  while  making  ratings.  They  were 
told  that  there  was  no  time  limit,  and  that  they  were  not  to  discuss  their  observations  or  ratings  with  other 
participants.  Each  windscreen  was  rated  and  the  rating  sheet  was  handed  in  before  the  next  windscreen 
was  seen.  The  identification  of  individual  windscreen  panels  was  concealed  by  code  letters  so  that  raters 
would  not  remember  previous  ratings  or  judgments  made  on  specific  windscreens. 

THE  DISTORTION  RATING  SHEETS 

As  each  observer  stepped  up  to  a  windscreen  to  rate  it  on  distortion  he  was  handed  a  score  sheet  containing 
questions  and  blank  spaces  for  writing  down  or  checking  his  responses.  The  score  sheets  for  all  11 
windscreens  were  identical.  Figure  1  is  a  sample  blank  score  sheet.  Note  that  the  first  question  asked  the 
observer  if  there  was  enough  distortion  in  the  windscreen  to  affect  flying  performance.  The  second  asked  if 
the  windscreen  was  acceptable.  Question  3  had  the  observer  rate  the  windscreen  on  eight  distortion 
variables:  line  splitting,  line  bending,  line  blurring,  banding,  shimmer,  magnification  (bulls  eyes),  other, 
and  overall  distortion.  There  were  five  checklist  categories  of  amount  of  distortion  observed:  none  (0),  little 
(1),  moderate  (2),  large  (3),  and  extreme  (4).  The  observer  checked  the  space  indicating  the  appropriate 
amount.  A  space  was  also  provided  for  indicating  the  location  of  the  distortion  on  a  standard  diagram  of  the 
windscreen. 

The  last  question  asked  the  observer,  if  he  rejected  the  windscreen,  to  indicate,  relevant  to  the  standard 
diagram,  the  location  of  the  area  that  had  such  bad  distortion  as  to  cause  windscreen  rejection. 
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Part  Identification 


Name 


1.  Would  you  say  that  there  is  enough  distortion  in  this  part  to  affect  flying 
performance? 

Yes  _  No  _ 

2.  In  your  opinion,  is  this  part  acceptable? 

Yes  _  No  _ 

3.  The  following  optical  defects  are  components  of  distortion.  Please  rate  the 
part  on  these  defects  by  checking  the  scale  below  CO  thru  4).  Also,  if  a 
moderate  amount  or  more  is  checked  (category  2  or  higher),  indicate  in  the 
last  column  the  area  or  areas  (from  the  diagram  provided)  where  the  defect 
was  observed. 


A. 


B. 


C. 


D. 


E. 


F. 


G. 


H. 


Line  Split 

Line  Bending 

Line  Blurring 

Banding 

Shimmer 

Magnification 
(Bulls  Eyes) 

Other 

Overall  Distor¬ 
tion 


Amount  Observed 

None 

Little 

Moderate 

Large 

Extreme 

Diagram 

Locations 

4.  If  you  rejected  the  part  in  question  "2"  above,  what  defect  (A),  in  what  area 
(where),  on  the  diagram,  caused  your  rejection? 


Figure  1  The  score  sheet  format. 
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SECTION  III 
RESULTS 

LENS  FACTOR  AND  DISPLACEMENT  GRADE 

Table  2,  under  the  column  heading  “Objective  Measures,”  lists  the  lens  factor  (LF)  and  displacement  grade 
(DG)  of  the  11  multilayer  plastic  F-lll  windscreens.  Figure  2  is  a  plot  of  LF  against  DG.  The  plot  includes 
Thompson’s  area  of  probable  acceptance,  area  requiring  subjective  evaluation,  and  area  of  probable 
rejection.  Note  that  windscreen  “M”  is  atypical:  it  is  the  only  one  of  the  11  with  a  high  displacement  grade 
combined  with  a  low  lens  factor.  The  figure  gives  the  least-squares  best  fit  linear  equation  relating  LF  and 
DG  for  the  11  windscreens:  DG  =  131  LF  —  28.6.  The  data  point  “M”  was  not  used  in  the  derivation. 

The  Pearson  product-moment  correlation  coefficient,  r,  between  LF  and  DG  was  +.6311  for  all  11  windscreens 
and  +.8752  when  atypical  windscreen  “M”  was  excluded.  Both  of  these  r  values  are  statistically  significant 
at  the  .05  level  of  significance.  There  is  an  appreciable  tendency  for  either  value  to  be  high  when  the  other  is 
high,  and  similarly  for  medium  and  for  low  values. 

The  fairly  high  positive  LF-DG  correlation  is  not  surprising.  The  author  had  previously  measured,  from  a 
graph  in  Thompson’s  report,  the  LF  and  DG  values  of  the  65  glass  F-lll  windscreens  used  by  K.  E. 
Thompson  in  1969  at  General  Dynamics  and  calculated  an  r  of  +.7554  between  LF  and  DG.  This  large 
positive  value  is  statistically  significant  at  the  .01  level  of  statistical  significance.  Thus,  LF  and  DG  vary 
together  in  both  glass  and  plastic  F-lll  windscreens. 

The  first  question  on  the  observer’s  rating  sheet  asked  if  there  was  enough  distortion  in  the  windscreen  to 
affect  flying  performance.  The  second  question  asked  if  the  windscreen  was  acceptable  (for  use  on  an  F-lll 
aircraft).  Results  are  shown  in  table  2.  Figure  3  plots,  against  lens  factor,  the  percentage  of  observers  (%  I) 
who  say  that  the  distortion  is  enough  to  influence  flying  performance.  From  the  figure  it  is  clear  that  %  I  is 
unrelated  to  lens  factor.  The  correlation  coefficient,  r,  between  LF  and  %  I  is  only  -.1625,  too  low  to  be 
statistically  significant.  Even  when  windscreen  “M”  is  excluded,  r  is  only  .-1735,  also  too  small  to  be 
statistically  significant.  In  either  case,  the  amount  of  relationship  was  no  more  than  would  be  expected  by 
chance. 

Figure  4  plots,  against  displacement  grade,  the  percentage  of  observers  who  believe  that  enough  distortion 
is  present  to  influence  flying  performance  (%  I).  From  the  figure  it  is  clear  that  %  I  is  not  related  to,  or 
predictable  from,  DG.  This  is  confirmed  by  the  r  between  %  I  and  DG  of  only  +.3879  for  all  11  windscreens  and 
only  -.1737  when  windscreen  “M”  is  excluded.  Neither  value  is  statistically  significant. 

Results  for  the  second  question  on  acceptability  were  similar  to  the  results  for  influence  as  far  as  LF  and  DG 
are  concerned.  The  r  for  LF  -  %  A  was  only  +.1899  for  all  11  windscreens  and  +.1423  for  all  11  and  +.1899  with 
“M”  excluded.  These  values  are  all  too  low  to  be  statistically  significant:  no  relationship  has  been  shown  to 
exist  between  DG  and  acceptability. 

Clearly,  neither  lens  factor  by  itself  nor  displacement  grade  by  itself  was  related  to  either  the  percentage  of 
observers  accepting  the  windscreens  (%  A)  or  to  the  percentage  who  expected  effects  upon  flying  perfor¬ 
mance  (%  I).  As  mentioned  earlier,  Thompson  had  found,  in  a  study  of  65  glass  F-lll  windscreens,  that  a 
combination  of  LF  and  DG  was  efficient  in  sorting  windscreens  into  those  acceptable  to  and  those  not 
acceptable  to  pilots.  Earlier  in  the  present  paper  a  quantity  R,  defined  as  R^KLF)2  +  (DG/1,000  +  1)*  was 
shown  to  do  essentially  the  same  sorting  as  Thompson’s  acceptability  graph.  For  the  11  windscreens  of  the 
present  study,  the  correlation  coefficient  (r)  of  %  I  with  R  was  trivial  in  magnitude  (-.083)  and  statistically 
nonsignificant,  as  Y  based  on  T/N  arc  sin  transform  (+.131).  Small  and  statistically  nonsignificant  r’s  were 
also  found  with  windscreen  “M”  excluded.  Clearly,  LF  and  DG  in  combination,  using  the  “R”  formulation, 
were  unrelated  to  either  the  acceptability  or  the  judged  influence  upon  flying  of  the  11  windscreens. 

The  numbers  beneath  each  data  point  in  figure  2  give,  for  each  windscreen,  the  percentage  of  observers  who 
found  it  acceptable  (%  A)  and  the  percentage  who  believed  that  enough  distortion  was  present  to  influence 
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DISTORTION  DATA  FOR  11  F-lll  WINDSCREENS 
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Figure  3.  %  I  plotted  against  displacement  grade. 
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Figure  4.  %  A  plotted  against  lens  factor. 


flying  performance  (%  I).  This  graph’s  areas  of  acceptable,  questionable  (gray)  and  unacceptable  are  those  of 
Thompson.  Only  one  of  the  11  windscreens  was  in  the  gray  area  (and  was  unacceptable),  while  the  two  in  the 
area  of  probable  rejection  were  both  judged  as  being  acceptable.  All  windscreens  in  the  area  of  probable 
acceptance  were  found  to  be  acceptable.  For  the  11  F-lll  windscreens  that  were  exclaimed  it  must  be 
concluded  that,  like  the  present  author’s  “R,”  Thompson’s  criterion  which  was  also  based  on  LF  and  DG  was 
not  useful  for  eliminating  windscreens  that  would  be  optically  unacceptable.  It  is  not  unlikely  that  the  11 
windscreens  that  were  tested  were  preselected  by  the  manufacturer  to  eliminate  windscreens  that  would  be 
unacceptable  from  the  viewpoint  of  distortion.  Even  so,  doubt  is  thrown  upon  the  idea  that  windscreens  not 
passing  inspection  by  Thompson’s  criteria  are  unsuitable  for  installation  in  aircraft.  More  data  are  needed, 
but  it  is  likely  that  all  windscreens  falling  into  either  his  gray  area  or  his  area  of  probable  rejection  will 
require  subjective  visual  evaluation.  Falling  into  the  area  of  probable  rejection,  by  itself,  is  not  sufficient 
cause  for  rejection. 

Another  inference  that  can  be  drawn  from  figure  2  is  that  one  should  not  expect  to  find  significant 
correlations  between  %  A  and  either  LF  or  DG,  nor  between  %  I  and  either  LF  and  DG.  When  these  four 
correlations  were  calculated,  none  was  large  enough  to  attain  statistical  significance.  This  is  shown  in 
tables  3,  4  and  5. 

Displacement  guide  is  correlated  positively  and  significantly  with  lens  factor,  as  was  mentioned  earlier,  in 
F-lll  windscreens. 

ESTIMATED  INFLUENCE  AND  ESTIMATED  ACCEPTABILITY 

It  is  to  be  expected  that  as  the  percentage  of  observers  who  think  that  enough  distortion  is  present  to 
influence  performance  (%  I)  increases,  %  A  would  decrease.  The  r  between  influence  and  acceptability  should 
be  high  and  negative.  The  correlation  tables  verify  this  expectation,  with  r  =  -.8916  for  all  11  windscreens  and 
r  =  -.7515  for  all  except  windscreen  “M.”  The  r  values  are  not  changed  much  when  the  data  are  arch  sin 
transformed,  the  usual  data  transformation  for  percentage  data,  to  make  the  distributions  more  nearly 
Gaussian  for  statistical  analyses. 

ESTIMATED  INFLUENCE  AND  DISTORTION  ESTIMATES 

It  is  to  be  expected  that  %  I  would  be  positively  correlated  with  some  of  the  distortion  estimates,  and  the 
correlation  tables  bear  out  this  expectation.  Statistically  significant  positive  r’s  are  found,  when  using  all  11 
windscreens  and  when  omitting  the  data  for  windscreen  “M,”  between  judged  influence  on  pilot  perfor¬ 
mance  (%  I)  and  the  overall  distortion  estimate,  grid  line  bending,  grid  line  blur,  banding,  and  shimmer. 
Some  of  the  t  values  are  almost  .90  and  some  are  even  higher.  N  o  significant  relationship  is  found  between  % 
I  and  either  magnification  (or  bulls  eyes)  or  line  splitting.  The  range  of  values  of  both  magnification  and  of 
line  splitting  covered  by  the  11  F-lll  windscreens  was  quite  small,  as  no  windscreen  had  much  of  either  one. 
With  a  restricted  range  of  values,  only  small  values  of  r  between  either  of  these  two  variables  and  other 
variables  would  be  expected. 

ACCEPTANCE  AND  DISTORTION  ESTIMATES 

As  expected,  statistically  significant  correlation  coefficients  are  found,  using  all  11  windscreens,  between  % 
A  and  the  same  variables  that  were  significantly  correlated  with  %  I.  Actual  numerical  values  were 
somewhat  lower.  %  A,  as  was  the  case  with  %  I,  did  not  correlate  significantly  with  estimated  magnification 
nor  with  grid  board  line  splitting.  However,  with  windscreen  “M”  omitted,  only  the  estimated  amount  of 
banding  is  significantly  correlated  with  the  percentage  of  observers  accepting  each  windshield  (96  A). 

OVERALL  DISTORTION  AND  DISTORTION  ESTIMATES 

One  would  expect  that  an  observer’s  estimate  of  the  overall  distortion  in  a  windscreen  would  be  a  mental 
summation  of  the  various  types  of  distortion  that  were  present  Thus,  statistically  significant  r  values  with 
one  or  more  of  the  various  types  Of  distortion  would  be  expected.  In  the  correlation  table  for  all  11 
windscreens,  there  are  high  positive  statistically  significant  r’s  between  the  overall  distortion  estimate  and 
shimmer,  banding,  grid  line  blur  and  grid  line  bending.  A  low  but  statistically  significant r  is  found  between 
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The  multiple  correlation  coefficient  between  estimated  overall  distortion  and  shimmer,  banding,  blurring,  bending 
and  magnification  is  +.8188. 
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The  r  between  %I  and  %A  when  both  are  arc  sin  transfored  is  also  -.7085. 

The  multiple  correlation  coefficient  between  estimated  overall  distortion  and  shimmer,  banding,  blurring, 
bending  and  magnification  is  +.4848. 


TABLE  S 


100X  (COEFFICIENT  OF  DETERMINATION)  FOR  STATISTICALLY 
SIGNIFICANT  CORRELATION  COEFFICIENTS 
(A)  All  eleven  F-lll  windscreens 


Measure 

R 

DG 

LG 

Mag. 

Shim. 

Band 

Blur 

Bend 

Split 

Dist* 

XA 

Influence  *  XI 

“Hi 

XI  Arc  Sin  x' Formed 

■1 

■*■1 

Acceptance  *  XA 

■ m 

“M 

*1 

XA  Arc  Sin  X' Formed 

rz 

Overall  Distortion 

Line  Splitting 

•m 

Line  Bending 

ima 

•m 

Line  Blurring 

'i 

Banding 

l“fli 

Shimmer 

*1 

Magnification 

Lens  Factor  (LF) 

40 

■ _ 

|  Displacement  GradefDOlm^P 

(B)  All  windscreens  except  windscreen  "m" 


Measure 

R 

DG 

LF 

Mag. 

Shim 

Band 

Blur 

Bend 

Split 

Dist* 

_%A 

Influence  =  XI 

“I 

mm 

"1 

"I 

XI  Arc  Sin  X' Formed 

2E 

a= 

a: 

“I 

Acceptance  ”  XA 

"m 

XA  Arc  Sin  X'Formed 

Overall  Distortion 

■"I 

Line  Splitting 

mgm 

Line  Bending 

Line  Blurring 

rm 

Banding 

■ 

Shimmer 

®1 

“’ll 

B 

Magnification 

1  Displacement  Grade  (DG)  LJ&i  1 

t  Coefficients  of  determination  are  the  squares  of  the  correlation  coefficients.  When 
multiplied  by  100,  they  represent  the  percentage  of  the  total  variance  In  one  variable 
that  Is  associated  with  or  predictable  from  the  variation  In  the  other  variable.  A 
value  of  100  means  that  one  variable  is  predictable  with  zero  error  from  the  other 
variable. 

*  Dlst.  *  overall  distortion  estimate. 
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overall  distortion  and  magnification,  but  no  significant  r  is  found  with  grid  line  splitting.  According  to 
observer  ratings,  windscreen  “M”  contains  the  most  magnification.  With  the  data  from  “M”  excluded,  the  r 
between  estimated  overall  distortion  and  either  shimmer  or  magnification  is  not  statistically  significant. 

SOME  RELATIONSHIPS  BETWEEN  TYPES  OF  DISTORTION 

The  correlation  tables  give  the  amount  of  covariation  or  correlation  between  observer  ratings,  as  well  as 
between  the  objective  measures  of  LF,  DG,  and  their  composite,  R.  Several  of  these  relationships  have  been 
discussed,  and  it  was  noted  that  grid  board  line  splitting  did  not  correlate  significantly  with  any  other 
measure,  while  magnification  estimates  had  correlations  that  were  low,  though  they  were  statistically 
significant  in  some  cases.  Overall  distortion  has  already  been  discussed.  This  leaves  us  with  the  question: 
What  types  of  distortion  have  a  large  impact  on  %  A,  %  I  and  overall  distortion  other  than  these  three 
themselves?  For  all  11  F-lll  windscreens  these  distortion  types  are  shimmer,  banding,  blurring  and 
bending,  with  shimmer  excluded  when  windscreen  “M”  is  excluded.  A  question  of  some  interest  is:  “How 
closely  do  these  four  types  of  distortion  relate  to  each  other?”  Table  6  below,  for  all  11  windscreens,  extracts 
the  product  moment  correlation  coefficients  from  the  larger  correlation  table,  table  3. 

TABLE  6 

CORRELATIONS  BETWEEN  DISTORTION  TYPES 
FOR  ALL  ELEVEN  F-lll  WINDSCREENS 


Shimmer  Banding  Blurring 

Bending 

Blurring 

Banding 

+.7135  +755 Vo  +.8694 — " 

+.6050  +.8169 

+.7445 

Note:  All  tabled  r  values  are  statistically  signif¬ 
icant  at  the  ,05  or  higher  level  of  significance. 


From  the  table  it  is  clear  that:  (1)  all  correlations  are  positive,  as  one  might  expect,  (2)  all  are  statistically 
significant,  i.e.,  the  relationship  is  higher  than  can  reasonably  be  attributable  to  chance,  (3)  all  are  fairly 
large  in  magnitude,  with  the  r’s  between  shimmer  and  the  other  three  being  the  lowest.  Even  the  blurring- 
shimmer  r,  the  lowest,  is  +.6050,  accounting  for  36%  (or  100r2)  of  variance  in  prediction. 

As  indicated  in  the  previous  paragraph,  the  overall  distortion  rating  or  estimate  by  observers  is  correlated 
with  (or  related  to)  ratings  on  shimmer,  banding,  blurring  and  grid  line  bending.  It  may  be  noted  from  table  3 
for  all  11  windscreens  that  banding  ratings  correlate  +.9608  and  bending  ratings  +.9317  with  observer 
estimates  of  overall  distortion.  Thus,  either  banding  or  bending  ratings  can  account  for  most  of  the  variance 
in  overall  distortion.  In  other  words,  one  could  use  the  mean  rating  on  either  of  these  two  variables  for  any 
given  windscreen  to  do  a  fair  job  of  predicting  the  overall  distortion  rating  for  the  windscreen.  The 
prediction  efficiency  of  shimmer  or  blur  ratings  for  predicting  the  overall  distortion  rating  for  any  of  the  11 
windscreens  is  not  low,  but  is  lower  than  for  bending  or  banding  estimates:  the  r  for  shimmer-overall 
distortion  is  +.7893  and  the  r  for  blur-overall  distortion  is  +.8509. 

The  multiple  correlation  coefficient  for  overall  distortion  rating  is  defined  as  the  correlation  between  the  11 
average  values  of  overall  distortion  estimates  for  the  11  windscreens  and  the  best  estimate  (or  prediction)  of 
overall  distortion  rating  for  each  of  the  11  based  on  the  average  ratings  on  several  types  of  distortion. 
Multiple  R  is  +.9881,  a  very  high  and  statistically  significant  value,  between  overall  distortion  ratings  and 
predicted  overall  ratings  based  on  ratings  on  shimmer,  banding,  blurring  and  bending.  The  square  of  R  is 
.9763,  indicating  that  98%  of  the  variance  in  the  overall  distortion  rating  can  be  accounted  for  by  the  ratings 
on  thesefour  variables.  The  predicted  average  distortion  rating  for  any  one  of  the  11  windscreens  is  given  by 
the  equation:  Yi  =  .2379  (Shimmer)i  +  .6319  (BandingH  +  .3130  (BlurringH  +  .1886  (BendingK  -  .0743.  The 
values  used  in  the  parentheses  are  the  average  values  for  the  particular  windscreen,  i.  It  is  instructive  to 
apply  the  equation  to  the  11  windscreens  to  see  how  well  it  predicts  for  individual  windscreens.  This  is  done 
in  table  7. 
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TABLE  7 


OBTAINED  AND  PREDICTED  OVERALL  DISTORTION  RATINGS 


Overall  Distortion 

W 

I  N  D 

s  c 

R.  E  E  N 

Rating 

A 

B 

C 

D 

E 

F 

G 

H 

J 

L 

M 

Obtained  . 

1.36 

1.36 

1.54 

1.29 

i.ii 

1.29 

1.86 

.786 

2.00 

1.21 

2.f>4 

Predicted 

1.25 

1.45 

1.48 

1.35 

1.10 

1.38 

1.88 

.781 

1.84 

1.42 

2.64 

%  Error* 

+8.1 

-6.6 

+1.3 

-4.7  +10.6 

-7.0 

-1.1 

+6.4 

+8.0 

-17.4 

0 

*%  Error  *  (obtained  -  Predicted)  x  100  7  (Obtained) 

+  The  predicted  value  is  based  on  a  combination  of  shimmer,  banding,  blurring  and 


bending,  using  the  prediction  formula  given  in  the  text. 

Note  from  the  table  that  the  percentage  error  in  predicting  overall  distortion  rating  from  a  combination  of 
raings  on  four  other  variables  is  small  for  most  windscreens:  the  two  largest  errors  are  17.4%  for  windscreen 
“L”  and  10.6%  for  windscreen  “E.”  The  r  value  between  the  obtained  and  predicted  values  in  the  table  is 
>.9749  and  would  be  >.9891  had  the  values  not  been  rounded  off  to  3  digits.  This  represents  excellent 
predictivity. 

Magnification  ratings  do  not,  as  indicated  earlier,  have  a  high  correlation  with  overall  distortion  ratings. 
Even  so,  a  multivariate  analysis  was  conducted  with  magnification  ratings  added  to  those  for  bending, 
blurring,  banding  and  shimmer.  Multiple  R  for  all  11  windscreens  was  .9765,  accounting  for  95.4%  of  the 
variance  in  predicting  overall  distortion  ratings.  Multiple  R  for  all  windscreens  except  windscreen  M  was 
.9443,  accounting  for  89.1%  of  the  variance  in  predicting  overall  distortion  ratings.  In  both  the  10  and  11 
windscreen  cases  the  regression  coefficient  for  magnification  was  small  and  negative.  Adding  magnifica¬ 
tion  ratings  to  the  prediction  equation  did  not  have  much  effect  upon  its  prediction  efficiency.  As  a  matter  of 
interest,  the  prediction  equations  using  all  five  independent  variables  are  as  follows  for  all  11  windscreens 
and  for  only  10  (“M”  excluded): 

Y  Overall  (11)  =  .32063  (Shimmer)  >  .72383  (Banding)  >  .27942  (Blurring) 

+.10848  (Bending)  -.09101  (Mag.)  -.07977 

Y  Overall  (10)  =  .33973  (Shimmer)  >.72888  (Banding)  >.26335  (Blurring) 

>.11436  (Bending)  -.09055  (Mag.)  -.09603 

The  relative  contributions  to  the  overall  distortion  prediction  of  the  various  types  of  distortion  is  best  seen 
from  examining  the  coefficients  of  the  various  distortion  estimates  when  the  equations  are  in  standard 
form,  i.e.,  relative  deviation  scores  rather  than  raw  scores  (or  ratings  directly  from  the  rating  scale)  are  used. 
Table  8  gives  these  coefficients,  which  are  known  as  standard  partial  regression  coefficients,  for  the 
independent  (or  predictor)  variables. 

Table  8  shows  (1)  banding  ratings  are  the  most  predictive  of  overall  distortion  ratings;  (2)  blurring  and 
shimmer  ratings  carry  considerably  less  weight  than  banding,  but  are  comparable  to  each  other;  (3)  the 
contribution  from  bending  ratings  in  predicting  overall  distortion  ratings  is  quite  small;  (4)  the  importance 
or  weight  of  magnification  ratings  is  not  negligible,  and  exceeds  that  of  bending  ratings,  and  (5)  the 
magnification  coefficients  are  negative,  indicating  that  they  have  a  suppressant  effect  upon  the  contribu¬ 
tion  of  the  other  variables. 

TABLES 


STANDARD  PARTIAL  REGRESSION  COEFFICIENTS  FOR 
PREDICTING  OVERALL  WINDSCREEN  DISTORTION 


v* 

n** 

Shlamer 

Blurring 

Bending 

Magnification 

4 

11 

.1^09) 

.59^04 

.15^8 

.iJWRT 

5 

11 

.48837 

.69598 

.30266 

. 10304 

-.25547 

5 

10 

.23853 

.66522 

.18825 

.10206 

-.25531 

V*  -  number  of  variables  used  to  predic>  distortion  rating" 
n**  “  number  of  windscreens  used 
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DISTORTION  RATINGS  OF  PILOTS  AND  QUALITY  INSPECTORS 


The  data  discussed  up  to  this  point  have  been  for  all  14  observers.  The  observers  included  six  pilots  with 
extensive  flying  experience  and  six  visual  quality  inspectors  (V QI).  The  obvious  question  is:  “How  closely  do 
the  ratings  of  the  two  groups  compare?”  Will  windscreens  found  acceptable  by  VQI  be  acceptable  to  pilots? 
Ideally,  this  would  be  answered  by  comparing  ratings  made  by  VQI’s  in  the  factory  with  ratings  made  in  the 
field  by  pilots  flying  aircraft  outfitted  with  the  windscreens  in  question.  This  type  of  data  is  not  available,  so 
what  will  be  compared  is  ratings  made  by  the  two  groups  in  the  factory. 

The  average  ratings  of  the  pilot  and  inspector  groups  for  each  of  the  11  windscreens  is  given  in  table  9.  The 
means  or  averages  for  each  type  of  distortion  are  given  at  the  bottoms  of  the  columns  and  may  be  seen  in  a 
graphic  form  in  figure  5.  Figure  5  shows:  (1)  Six  of  the  7  ratings  are  in  very  close  agreement.  (2)  The  only  one 
not  numerically  in  close  agreement,  line  splitting,  is  closer  to  zero  distortion  than  to  “Little”  distortion  for 
both  groups,  so  that  even  here,  for  all  practical  purposes,  agreement  is  close.  (3)  Visual  quality  inspectors 
rate  distortion  slightly  lower  than  do  pilots  on  all  types  of  distortion,  but  the  differences  between  the  groups 
are  negligible.  From  the  figure  it  is  clear  that  the  ratings  of  the  two  groups  have  a  high  positive  correlation; 
they  vary  together.  The  product  moment  correlation  coefficient  between  the  two  groups  on  the  7  distortion 
estimates  is  +.9557,  bearing  out  this  expectation. 
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AVERAGE  DISTORTION  RATINGS  FOR  PILOTS  AND  VISUAL  QUALITY  INSPECTORS 


AMOUNT  OF  DISTORTION 


Moderate  2 


TYPE  OF  DISTORTION  RATING 


Figure  5.  Average  Distortion  Ratings  of  6  Pilots  and  6  Visual  Quality  Inspectors  on  11  F-lll  Windscreens. 
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FREQUENCIES  OF  USE  OF  RATING  CATEGORIES:  (A)  BY  TYPE  OF  DISTORTION 


CATEGORIES  OF  DISTORTION  USED  BY  PILOTS  AND  INSPECTORS 


In  the  pTevious  paragraph  it  was  shown  that  pilots  and  visual  quality  inspectors  were  in  close  agreement  on 
the  average  amount  of  each  type  of  windscreen  distortion.  There  were  five  categories  of  amount  of  distortion, 
ranging  from  none  to  extreme.  The  total  number  of  responses  by  each  group  in  each  distortion  category  is 
given  in  table  10  and  is  shown  in  figure  6.  It  is  apparent  from  the  figure  that  visual  quality  inspectors  tend  to 
use  the  “None”  category  more  often  than  the  pilots  did,  with  a  consequent  tendency  to  use  the  “Little” 
category  less  often.  Agreement  is  close  on  the  “Moderate”  and  “Large”  distortion  categories  and  the 
magnitude  of  the  difference  in  the  “Extreme”  category  is  small.  The  frequency  of  category  use  may  be 
arranged  in  the  form  of  a  contingency  table  showing  type-of-rater  by  amount-of-distortion.  This  is  done  in 
table  11. 

TABLE  11 

NUMBER  OF  RESPONSES  BY  DISTORTION  CATEGORY 
FOR  PILOTS  AND  FOR  VISUAL  QUALITY  INSPECTORS 


=.290,  etc. 

One  pilot  omitted  the  banding  rating  for  windscreen 
A,  and  the  rating  for  shimmer  on  H,  J  and  L.  One 
VQI  omitted  banding  on  H  and  blur  on  M,  a  second 
VQI  omitted  bending  on  B,  and  a  third  VQI  was  not 
available  to  rate  E  on  anything,  so  the  grand  sum 
is  910  instead  of  924. 

The  data  in  the  table  yeild  a  value  of  chi  square  of  11.25,  with  a  probability  of  less  than  .05.  Of  the  total  of 
11.25, 8.77  is  contributed  by  the  “None”  and  “Little”  categories.  It  must  be  concluded  that  the  use  frequencies 
by  categories  of  the  two  groups  are  significantly  different,  with  most  of  the  difference  being  in  the  two  lowest 
distortion  categories,  “None”  and  “Little.”  Due  to  the  very  low  usage  frequencies,  little  or  no  significance 
should  be  attributed  to  the  P  values  for  the  “Extreme”  distortion  category.  Note  that  the  probability  that  a 
pilot  will  not  give  an  extreme  rating  is  .9987(1-.013),  and  the  probability  that  a  VQI  will  not  is  .9998(l-.002), 
the  difference  between  the  two  being  trivial. 

INDIVIDUAL  WINDSCREEN  RATINGS  BY  PILOTS  AND  VQI 

Table  12  gives  distortion  data  for  individual  windscreens.  The  means  given  in  the  last  two  columns  are 
plotted  in  figure  7,  with  windscreens  arranged  from  left  to  right  in  order  of  increasing  distortions  as  rated  by 
VQI.  Note  that  9  out  of  11  windscreens  receiver  lower  distortion  ratings  from  VQI  than  from  pilots.  Figure  7 
also  shows:  (1)  the  two  windscreens  with  the  largest  rating  difference  are  actually  only  a  fraction  of  a 
distortion  category  apart,  and  (2)  no  trend  is  obvious  for  the  differences  in  ratings  to  be  related  to  amount  of 
distortion  as  perceptually  judged.  Thus,  agreement  is  not  closer  for  low  distortion  windscreens  than  for  high 
distortion  ones,  or  the  reverse,  etc. 

A  more  sensitive  examination  for  a  differential  trend  may  be  made  by  estimating  the  distortion  of  each 
windscreen  as  the  average  of  the  two  groups,  and  plotting  against  this  the  distortion  ratings  of  the  two 
groups.  This  is  done  because  no  satisfactory  criterion  of  absolute  amount  of  distortion  is  available.  Even 
though  such  a  procedure  verges  upon  “bootstrapping,”  the  results  can  be  illuminating.  Figure  8  plots  the 
distortion  ratings  of  pilots  and  VQI  against  the  group  means  used  as  a  measure  of  amount  of  distortion.  The 
least-squares  best  fitting  straight  lines  are  shown.  If  both  groups  were  the  same,  the  equations  would  both  be 
Y  *  X.  Note  that  neither  is  very  far  from  this.  The  lines  are  close  together  and  do  not  depart  much  from  being 
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NUMBER  OF  RESPONSES  IN  CATEGORY  OR  INTERVAL 


None  Little  Moderate  Large  Extreme 

Distortion  Category 


Figure  6.  Reponse  frequencies  in  various  intervals  of  the  distortion  scale  for  pilots  and  visual  quality 
inspectors. 
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Figure  7.  Average  Dietortion  Rating!  of  11  F-lll  Windscreens  for  Pilots  and  for  Visual  Quality  Inspectors. 
Windscreens  are  Arranged  in  Order  of  Increasing  Distortion  as  Rated  by  VQI. 


DISTORTION 


Figure  8.  Distortion  ratings  for  pilots  and  inspectors  plotted  against  amount  of  distortion  estimated  as  the 
pilot-inspector  average. 


parallel.  The  slight  tendency  for  the  curves  to  converge  (to  indicate  closer  agreement)  at  higher  levels  of 
distortion  is  too  small  to  be  reliable.  Over  the  restricted  range  of  distortion  afforded  by  the  11  F-lll 
windscreens,  no  reliable  evidence  indicates  any  change  in  amount  of  agreement  with  amount  of  distortion 
as  perceptually  judged.  In  particular,  VQI  do  not  become  more  lenient  or  less  critical  as  compared  to  pilots  at 
higher  distortion  levels.  They  are  only  slightly  less  critical  than  pilots  at  all  levels  that  were  examined  in  the 
present  study. 

PILOTS  AND  INSPECTORS  ON  INFLUENCE  AND  ACCEPTABILITY 

The  numbers  of  pilots  and  inspectors  who  believe  that  each  windscreen  will  or  will  not  influence  pilot 
performance  and  is  or  is  not  acceptable  are  given  in  table  13.  The  bottom  line  of  the  table  gives  percentage 
values.  Note  that  59%  of  the  pilots  said  that  flying  performance  would  be  affected  while  77%  said  yes  on 
acceptability.  This  may  be  contrasted  with  the  15%  of  visual  quality  inspectors  on  influence  and  92%  on 
acceptability.  It  is  clear  that  pilots  were  more  likely  than  VQI  to  believe  that  the  distortion  that  was  present 
would  influence  flying  performance  and  were  also  more  likely  than  VQI  to  give  an  unacceptable  rating  to  a 
windscreen.  The  VQI  were  less  critical  than  pilots  in  judging  both  influence  and  acceptability. 

Tables  14  and  15,  set  up  as  contingency  tables,  contain  the  column  totals  for  the  frequencies  in  table  13.  The 
values  of  chi  square  calculated  from  the  numbers  in  these  tables  were  24.88  (P<.01)  for  influence  on  flying 
performance  and  4.79  (P<.05)  for  windscreen  acceptability.  The  finding  that  inspectors  were  less  critical 
than  pilots  in  judging  both  the  effects  of  distortion  on  flying  performance  and  windscreen  acceptability  is 
thus  supported  by  statistical  significance  tests. 
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Significantly  more  pilots  than  Significantly  more  pilots  than 

VQI  reject  windscreens.  VQX  believe  performance  will  be 

affected . 


SECTION  IV 

DISCUSSION  OF  RESULTS 


In  the  present  study  the  observers  were  not  trained  to  make  distinctions  between  types  of  distortion  nor  were 
types  of  distortion  defined  or  described.  It  is  likely  that  clear-cut  distinctions  were  not  always  drawn  by 
observers  between  types  of  distortion.  Similarly,  no  training  was  given  on  the  meaning  of  terms  such  as 
“Moderate  Distortion.”  Some  observers  may  have  tried  to  make  absolute  judgements  while  others  may  have 
been  influenced  by  the  distortion  levels  of  the  particular  set  of  11  windscreens.  They  were  already  familiar 
with  them  from  participating  in  a  separate  study. 

The  above  lack  of  refinement  or  sophistication  is  due  in  part  to  the  lack  of  a  set  of  standards  that  would 
clearly  distinguish  both  types  and  amounts  of  windscreen  optical  distortion.  For  types  of  distortion  that  do 
not  require  motion  to  detect,  the  standards  could  be  a  set  of  photographs  made  by  photographing  a  grid 
board  through  a  distorting  panel  or  computer  processing  an  undistorted  photograph  to  obtain  precisely 
controlled  types  and  amounts  of  distortion.  For  a  dynamic  type  of  distortion,  Buch  as  shimmer,  a  set  of 
three-dimensional  holograms  would  probably  be  required. 

In  the  present  study  there  were  high  correlations  between  types  of  distortion.  To  what  extent  are  these 
relationships  or  correlations  generalizable?  This  question  cannot  be  answered  because  the  correlations 
could  have  been  the  result  of  such  things  as  (1)  confusion  in  observers  on  distinctions  between  types  of 
distortion,  (2)  peculiarities  of  the  set  of  11  F-lll  windscreens  due  to  materials  or  manufacturing  processes.  It 
is  hypothesized  that  even  extensive  observer  training  with  a  set  of  excellent  optical  distortion  standards 
would  have  little,  if  any,  effect  upon  reducing  the  high  correlations  between  types  of  distortion.  Types  of 
distortion  readily  definable  and  commonly  regarded  in  theory  to  be  distinct  or  separate  may  not  be 
independent  in  realizable  windscreens. 

Aircraft  pilots  and  visual  quality  inspectors  were  quite  close  in  their  estimates  of  both  overall  optical 
distortion  and  specific  types  of  distortion.  While  pilots  rated  distortions  slightly  higher  than  did  the 
inspectors,  the  differences  were  inconsequential.  The  equivalence  of  the  ratings,  particularly  at  higher 
levels  of  distortion  where  differences  could  have  important  practical  consequences,  is  encouraging.  It  is 
probably  safe  to  conclude  that,  in  general,  subjective  estimates  of  amount  of  distortion  by  visual  quality 
inspectors  and  by  aircraft  pilots  will  not  differ  by  much. 

The  equivalence  in  estimations  of  amount  of  distortion  is  not  matched  by  equivalence  in  judging  whether  or 
not  enough  distortion  is  present  to  influence  flying  performance.  Pilots  were  appreciably  more  likely  to  rate 
a  windshield  as  likely  to  influence  performance.  Actually,  neither  the  C-141  pilots  nor  the  visual  quality 
inspectors  have  had  any  experience  with,  or  even  seen  data  on,  situations  in  which  flying  performance  was 
judged  or  measured  when  optical  distortion  levels  were  varied.  ThuB,  it  cannot  be  said  that  the  pilots  were  too 
severe  in  their  judgment  of  probable  influence  on  flying  or  that  the  inspectors  were  too  lenient. 

Windscreens  that  do  not  significantly  degrade  pilot  performance  are  required.  However,  it  is  not  known  just 
how  high  optical  quality  might  be  before  additional  quality  is  not  accompanied  by  signficant  increases  in 
the  performance  of  aircraft  pilots.  Very  high  optical  quality  is  both  extremely  expensive  and  unnecessary.  A 
level  of  optical  quality  can  be  found  that  is  entirely  adequate  and  is  financially  affordable.  What  the  Air 
Force  needs  now  are  valid  specifications  of  windscreen  optical  characteristics  and  inspection  methods  that 
will  insure  that  specifications  are  met.  The  present  paper  has  identified  several  optical  characteristics  that 
appear  to  pilots  and  windscreen  inspectors  to  be  relevant  to  pilot  performance.  Further  research  on  pilot 
requirements  and  on  objective  and  subjective  methods  for  specifying  and  measuring  windscreen  optical 
qualities  are  needed. 
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